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Abstract 

This paper makes three main contributions to the theory of communication complexity and stream 
computation. First, we present new bounds on the information complexity of AUGMENTED-INDEX. In 
contrast to analogous results for INDEX by Jain, Radhakrishnan and Sen [J. ACM, 2009], we have to 
overcome the significant technical challenge that protocols for AUGMENTED-INDEX may violate the 
"rectangle property" due to the inherent input sharing. Second, we use these bounds to resolve an open 
problem of Magniez, Mathieu and Nayak [STOC, 2010] that asked about the multi-pass complexity 
of recognizing Dyck languages. This results in a natural separation between the standard multi-pass 
model and the multi-pass model that permits reverse passes. Third, we present the first passive memory 
checkers that verify the interaction transcripts of priority queues, stacks, and double-ended queues. We 
obtain tight upper and lower bounds for these problems, thereby addressing an important sub-class of 
the memory checking framework of Blum et al. [Algorithmica, 1994]. 
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1 Introduction 



In recent work, Magniez, Mathieu and Nayak lfl31l considered the streaming complexity of language recog- 
nition. That is, given a string a of length n, what is the (randomized) space complexity of a recognizer for 
a language L that is allowed only sequential access to er? This question can be viewed as a generalization 
of the classic notion of regularity of languages: one now considers automata that are allowed (1) random- 
ization, and (2) a variable number of states that may depend on the input length. Their main result provided 
near-matching bounds for single-pass recognizers for DYCK(2), the language of properly nested parentheses 
of two kinds. In this paper, we look at the broader question and present the first multi-pass space lower 
bounds for several languages, including DYCK(2), resolving an open question of theirs. We also study the 
complexity of languages that arise in the context of memory checking JH, and present tight upper and lower 
bounds for them. Our key technical contributions rely on a new understanding of the information complexity 
of the augmented index problem, which leads to these multi-pass lower bounds. 

Background, Augmented Index, and a New Lower Bound. The index problem is one of a handful of 
fundamental problems in communication complexity [14]: Alice has a string x G {0, l} n , and Bob has an 
index k G [n]; the players wish to determine the kth bit of x, written as x^. It is easy to show that the 
problem is "hard" — requiring Q(n) communication — when messages only go from Alice to Bob, and is 
"easy" — solvable using O(logn) communication — without this restriction. The lower bound extends to 
randomized constant-error protocols [1J. This makes INDEX the canonical hard-for-one-way, easy-for-two- 
way communication problem. Is there really anything new to say about such a fundamental problem? 

As it turns out, there is, provided one asks the right questions. Since INDEX is an asymmetric problem, it 
makes sense to ask for the best possible tradeoff between the number, a, of bits communicated by Alice, and 
the number, b, communicated by Bob. As shown by Miltersen et al. |[T6l . we must have a > n/2°^ b \ and a 
simple two-round Bob — > Alice — > Bob protocol (with Bob announcing the output) shows that a < \n/2 b ~\ 
is achievable. A more nuanced question asks for the best tradeoff of information revealed by each player to 
the other in a protocol for INDEX, also called the information costs of Alice and Bob (we shall soon formally 
define these). In principle, this tradeoff could have been better, as it is possible for messages to reveal less 
information than their length. This issue was considered (in a more general quantum communication setting) 
by Jain, Radhakrishnan and Sen lfT2l who called this the "privacy tradeoff" for the problem, and showed 
that a > n/2°^ still holds, where a and b now represent information costs. 

Such an information cost tradeoff opens up interesting possibilities for applications to lower bounds 
for more complex problems, via the direct sum properties of this measure H |6j. One such applica- 
tion is the aforementioned DYCK(2) lower bound. However, the tradeoff theorem of Jain et al. is not 
strong enough to obtain the required direct sum result. One needs a tradeoff lower bound in a variant of 
INDEX where Alice and Bob have much more "help," in two ways. First, we relax INDEX so that Bob 
additionally gets to see the length-(/c — 1) prefix of Alice's input; the resulting variant has been called 
AUGMENTED-INDEX iflOl [T3l and the one-way communication lower bound easily extends to it [3]. Sec- 
ond, in our variant of AUGMENTED-INDEX, Bob also gets a check bit c G {0, 1} and must verify that 
Xk = c. This second twist clearly does not matter when considering communication complexity, but for 
us it makes a huge difference, because our applications require that we measure information cost under an 
"easy" distribution, where xu always equals c. 

With this background, we state our main theorem informally. A formal version appears as Theorem [231 
after the necessary definitions. 

Theorem 1.1 (Informal). In a randomized communication protocol for AUGMENTED-INDEX with two-sided 
error at most 1/ log 2 n, either Alice reveals Q(n) information about her input, or Bob reveals infor- 
mation about his, where information is measured according to an "easy " input distribution. 
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The natural point of comparison is a similar theorem of Magniez et al. lfl31 . that works only for restricted 
protocols: Alice must be deterministic (thus, her information cost is just her usual communication cost), 
and the protocol must be two-round with an Alice — >• Bob — > Alice communication pattern. Under these 
conditions, for errors below 0(l/n 2 ), they show that either Alice sends O(n) bits, or else Bob reveals 
SI (log n) information. Notice that this theorem is not quite a special case of ours, because of the higher 
lower bound on Bob's information cost. This is inevitable: for a general communication pattern, one cannot 
obtain a tradeoff that strong, because of the aforementioned a < \n/2 b ~\ upper bound. However, we suspect 
that the optimum tradeoff lower bound is in fact of the form a > n/2 0<yb \ where the O-notation hides factors 
polylogarithmic in b, and we leave this conjectured generalization of our theorem as an open problem. 

Ramifications: Streaming Language Recognition. In the streaming model, we have one-way access to 
input and working memory sublinear in the input size N. Historically, the problems considered in it have 
focused on estimating statistics. Recognizing structural properties of strings is just as natural a problem in 
this model, and yet such language recognition problems have only recently been considered. It transpires 
that AUGMENTED-INDEX has a key role to play in proving bounds here. 

A first application is direct: following lfl31 . a two-step argument shows fl(y/N) lower bounds for the 
multi-pass streaming complexity of Dyck languages. We first plug Theorem 11.11 into a direct sum theo- 
rem, which lower bounds the communication cost of a problem we call MULTI-AI (for "multiple copies of 
AUGMENTED-INDEX"). We then reduce MULTI-AI to, e.g., DYCK(2). The direct sum theorem is a natural 
extension to multiple passes of a similar single-pass theorem ( Ifl5l . where the authors called the relevant 
problems ASCENSION and MOUNTAIN). Thus, on the lower bound side, our chief contribution is Theo- 
rem 11.11 and its most important consequence is the multi-pass nature of the resulting lower bounds. In 
particular, this demonstrates a curious phenomenon: an explicit, natural data stream problem that is fairly 
easy given two passes in opposite directions (Magniez et al. give an 0(log 2 N) -space algorithm), whereas 
it is exponentially harder if only multiple unidirectional passes are allowed. 

A second application is to memory checking, whose study was initiated by Blum et al. [5] and continued 
by numerous groups including Ajtai ||2), Chu et al. JH, Dwork et al. [:TTJ, and Naor and Rothblum [17]. 
The problem, as considered in this paper, is to observe a sequence of N updates and queries to (an imple- 
mentation of) a data structure, and to report whether or not the implementation operated correctly on the 
instance observed. A concrete example is to observe a transcript of operations on a priority queue: we see 
a sequence of insertions intermixed with items claimed to be the results of extractions, and the problem 
is to decide whether this is correct. Much of the previous work allowed the checker to be invasive, by 
modifying the inserted items and/or introducing additional read operations. However, when the checker is 
more realistically restricted to being completely passive, and can only observe, the problem becomes that of 
understanding the (streaming) complexity of recognizing valid transcripts. For instance, we define PQ to be 
the language of valid transcripts of priority queue operations that start and end with an empty queue. One 
can similarly define languages STACK and DEQUE (for double-ended queues). The invasive protocols of 121 
typically modified the input items by attaching a "timestamp" to each inserted item, and this suggests variant 
languages PQ-TS, STACK-TS, and DEQUE-TS, where each extraction is augmented by the timestamp of its 
corresponding insertion. Though we briefly study these variant languages towards the end of this paper, we 
consider the languages without auxiliary information to be more natural from a theoretical point of view, 
and more applicable from a practical point of view. 

We present new algorithms for these basic memory checking languages: we show that PQ, STACK, and 
DEQUE can each be recognized in 0(y/N) space and one pass. On the lower bound side, Theorem [T7T] and 
MULTI-AI again come into play, giving Q(yN) bounds for each of these problems, even allowing multiple 
passes over the transcript. We observe that our upper bound for PQ strengthens the 0(y/N) bound of Chu et 
al. (H for PQ-TS. This strengthening is significant, for timestamps can radically simplify problems: we note 
that STACK-TS can be recognized in just 0(log N) space, in marked contrast to STACK. 
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Highlights. Since we view Theorem 11.11 as our most important technical contribution, we first give a 
careful exposition of its proof, in Section |2l The main technical hurdle in this proof is dealing with the 
fact that Alice and Bob share some of the input, which breaks the useful "rectangle property." (This is 
reminiscent of number-on-the-forehead communication Q, where input sharing makes strong lower bounds 
rather hard to prove.) The highlight of our proof is the Fat Transcript Lemma (Lemma I2T61 ). with its careful 
interplay between a suitably weakened rectangle property (Lemma [231) and the information cost measure. 

After a brief discussion (Section [3]) of the direct sum theorem and its implications for MULTI-AI, we 
address language recognition, in Section HI The highlight of this section is our algorithm for recognizing 
the language PQ. The ingenuity of the algorithm is that, rather than determining whether the interaction 
sequence is valid directly, the algorithm conceptually reorders inserts and extracts (in addition to some 
actual "local" reordering) in such a way that the new sequence is valid if and only if the original sequence 
is valid. This reordering procedure is designed such that small-space fingerprinting methods can be used to 
capture the state of the priority queue in a way they could not necessarily have been used for the original 
sequence. 

2 Augmented Index and an Information Cost Tradeoff 

Let Al = Al n (short for AUGMENTED-INDEX) denote the communication problem where Alice receives a 
string x G {0, 1}™, Bob receives an index k G [n], the length-(A: — 1) prefix of x, which we denote x± : k-i, 
and a check bit c G {0, 1}, and the goal is to output Al(x, k, c) := x^ © c, i.e., to output 1 iff x^ ^ c. 

We now formalize the notion of information cost. For this, we consider the most general model of 
randomization in communication protocols: the parties may share a public coin, and separately, each party 
may have its own private coin. Let P be such a randomized protocol for Al, let £ be a distribution on 
{0, l} n x [n] x {0, 1} (effectively, a distribution on legal inputs to P) and let (X, K, C) ~ £. Let R denote 
the public random string used by P, and let T denote the transcript of messages sent by Alice and Bob 
(including the final output bit) in response to this random input (X, K, C): note that, in general, T depends 
on X, K, C, R and the (unnamed) private random strings of the players. We define the information cost of 
P under £ to be a pair of real numbers (icost^(P), icost^(P)) defined as follows: 

icostf (P) := l(T:X\ K, C, R) ; icostf (P) := I(T :K,C\X,R). (1) 

In the above definition, the conditioning on R is crucial, for otherwise it is simple to make these costs 
equal zero. It follows from the basics of information theory that, regardless of the choice of £, these costs 
are bounded from above by the number of bits communicated by Alice and Bob, respectively, in P. Thus, a 
tradeoff lower bound on information cost is a stronger statement than a similar tradeoff on numbers of bits 
communicated. We now turn to the choice of input distribution. 

Definition 2.1. We let fi denote the uniform distribution on {0, l} n x [n] x {0, 1}. For (X, K, C) ~ fj,, 
we let mo := A* | (X K = C). Note that E^Alpf, K, C)} = \, whereas E M0 [Al(X, K, C)] = 0. Thus, 
intuitively, \i is a hard distribution for Al, whereas is an easv distribution. 

We are now ready to state our main theorem. But first, we give a technical lemma that is useful in 
formalizing some averaging arguments in its proof. 

Lemma 2.2. Consider functions fi, . . . , Jl '■ D — >■ M + , and numbers b\, . . . ,bi G M + , where L > is an 
integer and D is a finite domain. Let Z be a random variable taking values in D. Then 

Vi G [L] E[fi(Z)\ <bi 3 z G D Vi G [L] fi{z) < Lk . 
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Proof. Choose z to minimize g(z) := Yli b >o /«( z )/^«> anc ^ observe that E[g(Z)] < L, so that g(z) < L. 
Now pick any i G [L]. If 6j = 0, then clearly fi(z) = 0. Else, fi(z)/bi < g(z) < L. □ 

Theorem 2.3 (Main Theorem; formal version of Theorem ll.il) . If P is a randomized protocol for AI n with 
error at most 1/log 2 n under \x, then either icost^ (P) = O(n) or icost^ (P) = 0(1). In particular, the 
same tradeoff holds if P has worst case two-sided error at most 1 / log 2 n. 

Proof. We split this proof into two parts. First, assuming the contrary, we zoom in on a specific setting of 
the public random string of P and a single transcript that has certain "fatness" properties that play a role 
analogous to the "large rectangles" seen in elementary communication complexity. This part of the proof is 
reminiscent of arguments in Patra§cu's proof of the lopsided set disjointness lower bound lfT8l . Next, and 
more interestingly, we use these fatness properties to derive a contradiction, in Lemma 1231 Throughout the 
proof, and the rest of this section, we tacitly assume that n is large enough. 

Assume, to the contrary, that for every choice of constants 5\ and 82, there exists a (1/ log 2 n)-error 
protocol P* for Al with icost^ (P*) < 5\n and icost^ Q (P*) < 62- To write these conditions formally, let 
T* denote the transcript of P* (which uses a public random string R) on input (X, K, C) ~ /x; we will 
condition on Xk = C when necessary, to effectively change the input distribution to p,Q. We adopt the 
convention that a transcript, t, also specifies its final output bit, out(i). We then have 

Pr[out(r*) ^ M(X,K,C)] < l/log 2 n, 
I(T* : X I K,C,R,X K = C) < 6m , and 
I(T* : K, C I X, R, X K = C) < 5 2 . 

These three inequalities can be interpreted as bounding the expectations of three non-negative functions 
of the random string R. Any particular setting of R reduces P* to a private-coin protocol. Thus, applying 
Lemma I2T21 to these three inequalities, we see that there exists a private-coin protocol P for Al such that, if 
T denotes the transcript of P on input (X, K, C) ~ fj,, then 

Pr[out(T) ^ Al(X,K,C)] < 3/ log 2 ™, (2) 
l(T:X\ K, C, X K =C) < 35 in , and (3) 
I(T : K, C I X, X K = C) < 35 2 . (4) 

Notice that H(X | K,C,X K = C) = n - 1 and R(K,C | X,X K = C) = logn. Thus, by the 
characterization of mutual information in terms of entropy, we can rewrite (O and (0]) as 

n- 1-H(X I T,K, C, X K = C) < 3Sm, and (5) 
log 11 - R(K, C \T,X, X K = C) < 3<5 2 . (6) 

Definition 2.4. Let v denote the distribution of T and let Uq := v \ (Xk = C). For a specific transcript t, 
let p t denote the distribution fi \ (T = t). 

We can interpret ((5]) and (O as bounding the expectations of appropriate functions of a random transcript 
distributed according to uq. Inequality (0, though, is not of this form, since there is no conditioning on 
(Xk = C) ; instead, it says 

E T ^ U [Pr {x >,K>,c>)~PT t° ut ( T ) + AI(X',K', C')\] < 3/ log 2 n . (7) 

Since we have Pr[JTft- = C] = |, every transcript t satisfies vo(t) < 2v(t). Thus, switching the distribution 
in the outer expectation from u to v$ can at most double the left-hand side. In other words, we have 

E To ~, [Pr ( x',tf',C')~PT iont(T ) ± Al(X> , K>, C')]\ < 6/ log 2 n . (8) 
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Finally, we can say that transcripts drawn from uq typically output "0", because 



Pr To ^ [out(To) + 0] = Pr[out(T) jt Al(X, K, C)\X K = C] < 6/ log 2 n , (9) 

where the final step uses ©. By another averaging argument, applying Lemma l2~2l to the four inequali- 
ties ([5]), ©, © and (©, we conclude that there exists a transcript t such that 

n-l-R(X | K,C,X K = C,T = t) < 128m, 

logn-R(K,C \ X,X K = C,T = t) < 125 2 , 

Pr ( x',^,c')~Pt[out(t) + Al(X,K,C)] < 24/ log 2 n, and 

out(t) = 0. 



However, by the Fat Transcript Lemma (Lemma |2T6I > below, it follows that no transcript can simultaneously 
satisfy the above four conditions. This completes the proof. □ 

At this point, we need to understand what is special about the distributions pt (from Definition I2.4I ). 
given that they arise from transcripts of private-coin communication protocols. The key fact we need here is 
the so-called rectangle property of deterministic communication protocols lfl4l Ch. 1]. More specifically, we 
need its extension to private-coin randomized protocols, as used, e.g., by Bar-Yossef et al. (4| Lemma 6.7]. 

However, there is a complication here due to the fact that Alice and Bob share some information. Had 
Bob not received any part of Alice's input, p t would have been a product of a distribution on values of x, 
and another distribution on values of (k,c). But because Bob does, in fact, start out knowing x± : k-i, we 
can only draw the weaker conclusion given in the following lemma. 

Lemma 2.5. Let X = {0, l} n and y = {(w, k, c) G {0, 1}* x [n] x {0, 1} : \w\ = k - 1}. Let P be 
a private-coin protocol in which Alice receives a string x G X while Boh receives (w,k,c) G y, with the 
promise that w = x\-k-i- Then, for every transcript t of P, there exist functions pAt '■ X — > R + and 
PB,t '■ y — > such that 

V 0, k, c) G {0, l} n x [n] x {0, 1} : p t (x, k, c) = p A ,t( x ) ' PB,t(?[l ..k-l],k,c). 

Proof. Let T be the set of all possible transcripts of P and let T be a random transcript of P on input 
(X, K, C) ~ p. By the rectangle property for private-coin protocols (Lemma 6.7 of [4]), there exist map- 
pings q A : T x X -> M+ and q B : T x^->K+ such that 

Pr[T = t\ (X, K, C) = (x, k, c)] = q A (t; x) ■ q B (t; x 1:k -i, k, c) . 

Recall that p is just a uniform distribution. In particular, it decomposes as p(x,k,c) = pA{x)pB{k,c). 
Thus, by Bayes' Theorem, 

p(x, k, c) • P r[T = t\(X, K, C) = (x, k, c)] 
pt{x,k,c) = 



Pr[T = t] 

PA(x)-pB(k,c)-qA(t;x)-q B (t;xi:k-i,k,c) 



Pr[T = t] 

Now setpA,t(x) := pa{x) ■ q A (t; x) / Pr[T = t] and PB,t(w, k, c) := pB(k,c) ■ q B (t;w,k,c). □ 



We now state the promised lemma that, as shown above, finishes the proof of Theorem 12.31 We alert 
the reader that, from here on, the distribution of (X, K, C) is no longer uniform; instead, we condition the 
uniform distribution on a specific transcript. 
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Lemma 2.6 (Fat Transcript Lemma). There exist positive real constants 5 3 and 64 such that, for every 
transcript t of a private-coin communication protocol for A I, with out(t) = 0, we have the following. Let 
(X, K, C) ~ pt and let e(n) = 24/ log 2 n. Then the following conditions do not hold simultaneously: 

R(X I K,C,X K = C) > (l-6 3 )n, (10) 
H(ET, C I X, X K = C) > logn-<5 4 , (11) 
E[ai(X, K,C)] < e(n). (12) 

Proof. Suppose, to the contrary, that (fTOl . (fTTT t and (fl2l do hold for every choice of £3 and 64. Since 
C is determined by X and K whenever the condition Xk, = C holds, the left-hand side of (fTTT) equals 
R(K I X, X K = C). Also, C21) is equivalent to Vv[X K = C] > 1 - e(n). Thus, we can simplify £[]} to 

H(K|X) > Pr[XA' = C] • H(lf | X, Xk = C) > (1 - e(n))(log n - 5 4 ) > logn-2<5 4 , (13) 

where the last step uses the bound e(n) = o(l/ log n). Similarly, we can simplify (flOl to 

H(X) > R(X \K,C)> Pr[X K = C] • R{X | K,C,X K =C)> (1 - e{n)){l - 6 3 )n > (1 -2S 3 )n. 

(14) 

We now expand (fT2)) . In what follows, we use notation of the form "uOu" to denote the concatenation 
of the string u, the length- 1 string "0", and the string v. 

n 

E[ai(X,K,C)] = E E Pt(^M-Al(x,M 

fc=ixe{o,i} n ce{o,i} 

n 

= E E E E E Pt(ubv,k,c) ■ Al(ubv,k,c) . (15) 

fc=l uGjO,!}*- 1 be{0,l} i;G{0,l} n - fc cS{0,l} 



Let p,4 = pa,* and = PB,t be the functions given by Lemma 1231 Let A denote the distribution of 
(X, K), i.e., let \(x, k) = pt{x, k, 0) + pt{x, k, 1). Now, noting that Al(ubv, k,c) = 1 iff b 7^ c, we can 
manipulate (PT5T ) as follows. 



E[ai(X,K,C)} = E E E (M«0u,M)+M«l«>M) 
fc=i ue{o,i} fc - 1 ^elo.i}™-* 

n 

= ^ ^2 E (pa(u0v) -pb(u, k, 1) +pa,(u1v) -p B (u,k,0) 

fc=l ug{0,l} fe - 1 uG{0,l} n - fc 

/ 

= ^ fp B (it,fc,l) Pa(^Ov) + p B (u,k,0) ^2 PA{uli 

k=l uG{0,l} fe - 1 V fG{0,l} n - fe fG{0,l}"- fc 

> E E (rb(u, fc, 0) A;, 1)J • min < ^ p A (u0v), ^ 

fc=l uG{0,l} fc - 1 [?JG{0,l} n -* : ve{0,l} n ~ k 

= E E min l E % to >*0, E A:) I . (16) 

fc=l u£{0,l} fc - 1 [?;G{0,l} n -* : i>G{0,l} n - fc J 

Let a : {0, l} n -> [0, 1] and /3 : [n] -> [0, 1] be the marginals of A, i.e., a(x) := ££ =1 A(x, jfc) and 
:= X^e-fo i} n A ( x ' We now make the following crucial observations about these distributions. 
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Claim 2.7. We have ||A - a ® P\\ x = E ae {o,l}" £*=l l A ( x > fc ) " <*(.*) PW\ < ^( 41n2 ) ■ <J 4 . 

Proof. Using the characterization of mutual information in terms of Kullback-Leibler divergence, we get 

B KL (\\\a®P) = l(K:X) = H(K) — K(K \ X) < 26 4 , 

where the last step uses (PT3T ) and the basic fact that H(K) < logn. The claim now follows from Pinsker's 
inequality (for which see, e.g., [9, Lemma 12.6.1]). □ 



Claim 2.8. We have Y£=i \P(k) - l/n| < ^/(41n2) • <y 4 . 

Proof. Relax ( fT3l ) to H(i^) > logn — 2^4. Let 7 denote the uniform distribution on [n]. Then we have 
Dkl(/3 II 7) = logn — H(ET) < 25^. Now apply Pinsker's inequality. □ 



Let 5s := y 7 (4 In 2) • 64. Using Claim f2J\ to estimate the expression ([TBI ), keeping in mind that any 
particular \{x,k) term appears at most once in the summation, we get 

E[Al(X, K, C)] > jr/3(k) min { Yl a ( u0v )> Yl a(ulv)\-5 5 . (17) 

k=i «e{o,i} fe - 1 ve{o,i} n ~ k ve{o,i} n ~ k 

For each k G [n], define the probability distribution a k on {0, l} fc_1 by a k (u) := ^^gjo nn-k+i a(uw) ■ 
Pr[Xi : k-i = u]. Let : [0, 1] — >■ [0, 1] denote the binary entropy function, i.e., Hb(z) := — z log z — (l — 
z) log(l - z). Let IL7 1 : [0, 1] -> [0, |] denote the (well-defined) inverse of this function. Observe that, if 
Z is a binary random variable, then min{Pr[Z = 0], Pr[Z = 1]} = H ; J" 1 (H(Z)). Using all this, we obtain 



mm 



< ^2 a(u0v), a(ulv)> = a k (u) -mini 

^- „ . s- ( r\ 1 T n. — k. ~.r~ f r\ 1 "1 n — k * V 
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a fc+ i(u0) a fc +i(«l) 



^e{o,i}"- fc i>e{o,i} 



a k (u)-K- 1 (R(X k \X 1:k _ 1 = u)). (18) 



Plugging this back into ( flTT ), we obtain 

n 

E[AI(X, tf, C)] + 5 5 > P( k ) S " fc ^ " ( H ( Xfe I Xl: fc - x = u ) 

fe=l Mejo,!}*- 1 

> /?(£;) -H^ 1 I dfe(u) • R(X k I X 1:fe _! = u) J (19) 

fc=l Vuelo,!}*" 1 



n 



k=l \k=l 



^^■Hj 1 Hft|I llW ) > H- 1 ^/3(A;)-H(Jf fc | X 1:fc _!) 



(20) 



^ H ^ (E £ • H ^ I " *1 = H ^ " , (2D 

where ( fl"9l ) and (1201 follow from Jensen's inequality (and the convexity of H^ 1 ) and (|2TI ) uses Claim l2T8l and 
the fact that IL7 1 is increasing on [0,1]. We now invoke (fl~4l) and (fT2l) to obtain 

e(n) + 5 5 > H- 1 (l-2 < 53-<5 5 ). 



Recall that £5 = y 7 (4 In 2) • £4. By choosing #3 and £4 small enough, we can make the left-hand side of the 
above inequality approach and the right-hand side approach ~, and we finally have our contradiction. □ 
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3 A Direct Sum Argument 



Let MULTI-Al m) „ denote the following communication problem, involving 2m players A\, B\, ... , A m , B m . 
Each Ai receives a string x % E {0, l} n and each Bi receives an integer k l € [n], a bit c % G {0, 1}, and the 
length-(A; 4 — 1) prefix xl, ki _, of x l . The players wish to compute the predicate V/2=i Al n(% 1 , k l , c l ). The 
players may use private random strings and a common public random string, and use p rounds, where each 
round consists of a player sending an s-bit message privately to the next player in the following sequence: 

Ai -)■ Bi -> A 2 -)• £ 2 -> -> A m -> £ m -> A m -> ^ m _i ->• . 

At the end of these p rounds, Ai must announce the answer, which is required to be correct with probability 
at least (1 — e) on each possible input. Call such a protocol a [p, s, e]-protocol. We then have: 

Theorem 3.1. Every [p, s, 1/ log 2 n]-protocol for MULTI-Al m n satisfies ps = O (min{m, n}). 

This theorem is easily seen to be near-optimal: even with p = 1, we have a trivial protocol achieving 
s = 0{n) and another trivial protocol achieving s = 0{m log n). 

Notice that the augmented index problem studied in Section |2] satisfies AI n = MULTI-Ali n . Intu- 
itively, a protocol for MULTI-Al m n must solve m independent AI instances, and thus, must use about m 
times the communication that a single instance requires. To prove Theorem 13. II we formalize this intuition 
as a direct sum theorem, which we can prove using a suitable refinement of the information complexity 
paradigm [6]. To state this direct sum theorem, we need a suitable notion of information cost for proto- 
cols solving MULTI-AI. Let Q be a [p, s, e] -protocol for MULTI-Al m n . Let £ be a distribution on inputs 
to Q and let M m denote the sequence of messages sent by player B m when Q is run on a random input 
{(X 1 , K l , C*))™ 1 ~ £, using a public random string R. We strategically define the information cost of Q 
under £ to be 

icost 5 (Q) := I(M m : if 1 , C 1 , . . . , iT m , C m | X l ,...,X m ,R) . (22) 

It is worth noting that when m = 1, i.e., we are considering a protocol for AI, this definition specializes to 
that of icost^(Q) in (Q]). This is proved in Lemma [Aj] 

Theorem 3.2 (Direct sum theorem for Ai). Suppose there exists a [p, s, e]-protocol Q for MULTI-Al m n . 
Then there exists an e-error randomized protocol P for AI in which Alice sends at most ps bits in total, such 
that m ■ icost^ (P) < icost^®™ (Q) , where fiQ is as in Definition 12. 1 \ and ^jf™ denotes the m-fold product 
of jjlq with itself. 

Proof. This is a straightforward generalization, to multiple rounds, of a similar theorem of Magniez et 
al. Ifl5l . which applied only to restricted families of one-round protocols. Details appear in Appendix[A] □ 

We can now prove our multi-round communication lower bound on MULTI-AI as follows. 

Proof of Theorem \3.1\ Let Q be a [p, s, 1/ log 2 n]-protocol for MULTI-Al mn . From basic information the- 
ory, it follows that icost^®m (Q) < ps. Now, by Theorem 13.21 there exists a protocol P for AI with 

icost^ Q (P) < ps/m and in which Alice communicates at most ps bits, so that icost^ (P) < ps. By 
Theorem 12.3 1 either ps/m = 0(1) or ps = Q(n); i.e., ps = f2 (min{m, n}). □ 

4 Streaming Language Recognition and Passive Memory Checking 

In this section we present our results for recognizing certain languages in the data stream model. Of partic- 
ular interest is DYCK(2), the language consisting of the strings of well-balanced parentheses in two types of 
parentheses. Formally, representing '(*, ')', '[', and ']' as a, a, b, and b respectively, 
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Definition 4.1. DYCK(2) is the language generated by the context-free grammar S — > aSa \ bSb \ SS \ e. 

An important class of memory checking problems, which we call passive checking, can also be viewed 
as language recognition problems in the data stream model. For example, we define PQ to be the language 
corresponding to transcripts of operations, or "interaction sequences," of a priority queue that begins and 
ends with an empty queue. (Without this restriction, the resulting language would require Q(N) space to 
recognize, for simple reasons [8, Theorem 4].) Formally, 

Definition 4.2. An interaction sequence a = a\(J2 ■■■ o n is a string over the alphabet S = {ins(u), ext(u) : 
u G [U]}. Let PQ = PQ(U) be the language defined over £ where ins(u) is interpreted as an insertion of u 
into a priority queue, and ext(u) as an extraction of u from the priority queue. The state of the queue at any 
step j can be represented by a multiset Mj so that 

M = ; Mj = Mj-\ \ {min(M,_i)} if Oj = ext(-u) ; and Mj = Mj-x U {u} if Oj = ins(u) . 

(23) 

Then a £ PQ for \a\ = N iff M N = and Vj G [N] {aj = ext(u) => u = min(Mf_i)). 

We start by showing that a recognizer for PQ can also recognize DYCK(2) via an online transformation. 
The reduction proceeds as follows. Consider a string p over parentheses {a, a, b, b} and define 

height(p) := | {j : pj G {a, b}}\-\ {j : pj G {a, b}} \ 

and height(e) = 0. We transform p into ip(p) = 4>{pi-.i)4>{pi:2) ■ ■ ■ 4>{pi-.n) where: 



ins(2A^ — 2 height {pv.i-i)) if Pi = a 

ext(2N — 2 height if pi = a 

ms(2N - 2 height - 1) if Pi = b 

ext(2iV- 2 height (p 1:i ) - 1) if pi = b 



For example, the string (a,a,a,b,b,a) is transformed into (ins(12), ins(10), ext(10), ins(9), ext(9), 
ext(12)). The proof that ip(p) G PQ if and only if p G DYCK(2) is given in AppendixE 

Lemma 4.3. There exists an 0(log N)-space stream reduction from DYCK(2) to PQ(4N). 

Our first result on the complexity of stream language recognition uses Theorem 13. II to resolve the con- 
jecture of Magniez, et al. |[T5l regarding the multi-pass complexity of DYCK(2) and PQ. 

Theorem 4.4 (Multi-pass Lower Bounds for DYCK and PQ). Let L denote either DYCK(2) or PQ_(N). Sup- 
pose there exists a 0(1/ log 2 N)-error, p-pass, s-space, randomized streaming algorithm that recognizes L 
on length-N streams. Then ps = Vl(\fN). 

Proof. Using the reduction of Magniez, et al. lfl5l . an e-error p-pass randomized streaming algorithm for 
DYCK(2) that uses s bits of space on streams of length Q(mn) can be turned into a \p, s, e]-protocol for 
MULTI-Al m n . One can similarly reduce MULTI-Al m n to PQ(A r ); this was implicitly claimed without proof 
in ITT31 . Alternatively, Lemma FOI gives an explicit reduction from MULTI-Al m n to PQ via DYCK(2). To 
complete the proof, we combine these reductions with Theorem 13. II setting m = n. □ 

Unidirectional versus Bidirectional Passes. As noted earlier, DYCK(2) can be recognized in 0(log 2 N) 
space using two passes, one in each direction. On the other hand, the above theorem implies that achieving 
polylog(n) space with only unidirectional access to the input would require tl(yN) passes. To the best of 
our knowledge, this is the first explicit demonstration of such a strong separation between these two natural 
data stream models. 
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Algorithm 1 PQ-CHECK 

1: input a = a El a h a E2 a h . . . a Er a Ir where a El = a Ir = 

2: for k G {1, . . . , r}, u G {1, . . . , U}, do /[jfe] <- 0, X[k, u] «- 0, y [jfe, u] «- 0, Z[Jfe, u] <- 

3: for i G {1, . . . ,r} do 

4: for ext(u) G do 

5: £ ^— min{/c : /[&] < u} 

6: y [£,«]<- y [£,u\ + i 

7: <-max(y [£,u], Z[£,u]) 

8: for 1 < k < i do f[k] <- max(it, /[fc]) 

9: end for 

10: for ins(u) G <r /l do 

11: £ <- min{A; : /[A;] < u} 

12: if /[^ < n then X[£, u] f- X[^, u] + 1 

13: if f[£] = u then u] <-Y[£,u]-l 

14: end for 

15: end for 

16: if X / Z or X / y then reject else accept 



4.1 Passive Checking of Priority Queues 

Given the connection between PQ and DYCK(2) shown in Lemma |4~3l one might hope to adapt the algo- 
rithms of lfl"5ll to this problem. However, there seems to be no such easy reduction in this direction. For 
intuition, observe that DYCK(2) has a much stricter requirement on the permitted strings: if its second half 
consists of close-parentheses only, then its first half is uniquely determined. On the other hand, in PQ, one 
can find (N/2)\ sequences consisting of N/2 insertions followed by N/2 extractions that all agree on the 
second half. This suggests that the two languages are quite different. 

We therefore give a novel algorithm that leads to the following theorem, which is the main upper bound 
result in this paper. 

Theorem 4.5. We can recognize the language PQ in one pass, using 0(y/N (log U + log N)) bits of space: 
an input a £ PQ is accepted with certainty, and an input a PQ is rejected with probability > 1 — 1 /N 2 . 

Overview of the Algorithm. We first present a 0(Ur (log U + log N)) space algorithm for the case when 
the input string can be decomposed as a = a El a Il a E2 a l2 . . . a Er a Ir where a Ei is a sequence of extracts 
and a li is a sequence of inserts. We refer to a Ei a I{ as the ith epoch of the string and note that, for sufficiently 
large r, any a is of this form. After presenting the full space algorithm, we show how to transform a such that 
r = 0{yJ~N) and subsequently, to reduce the space to 0(yN). Finally, a necessary condition for a G PQ 
is that the extracts in each a Ei are in ascending order and that a El = a Ir = 0. Since both conditions are 
easily verified, we assume that they are satisfied. 

We present the algorithm PQ-CHECK as AlgorithmQ] We first describe its properties informally, before 
proceeding to a more formal analysis. 

1. For each epoch k, PQ-CHECK maintains a value f[k] that is the maximum value that has been extracted 
after the kth epoch. In particular, at the very start of the ith epoch, / [i — 1] = 0. 

2. Each insert/extract of u is assigned to the earliest epoch "consistent" with the current / values maintained 
by PQ-CHECK, i.e., £ = minjfc : f[k] < u}. Each ext(u) G o~ Ei is assigned to an epoch between 1 and 
% — 1 (this follows because the extracts in o Ei are in increasing order and f[i— 1] equals when the 
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first extract in a * is processed), while each ins(u) G a 1 * is assigned to an epoch between 1 and i. 
Importantly, for a G PQ, each ext(u) will be assigned to the same epoch as the most recent ins(u). 

3. The algorithm maintains arrays X, Y, and Z to track information about occurrences of item u assigned 
to epoch k (we later use hashing techniques to reduce the size of this information). Informally, X tracks 
the number of insertions of u assigned to epoch k before the first extraction of u that is assigned to epoch 
k, while Y tracks the number of extractions of u assigned to epoch k minus the number of insertions of 
u assigned to epoch k from the first extraction of u assigned to epoch k onwards. A necessary condition 
is that these two counts should agree. However, this counting alone fails to detect extractions of u that 
appear before the corresponding insertions. Therefore, Z is used to identify the maximum "balance" of 
u during epoch k. This should also match X if the sequence is correct, and we later show that these are 
sufficient conditions to check membership in PQ. 

Define f t (k) = max{u : cij = ext(u), \a El . . . a Ik | + 1 < i < t}. For u € [U] and t G [N], define 
b(t, u) = min{fc : ft(k) < u}. Given an interaction sequence a and u G [U], define 

cnt(a, u) := \{t : at = ins(u)}| — \ {t : a t = ext(u)}| . 

Lemma 4.6. After processing the tth element, Algorithm\l\has computed f[k] = ft(k), i.e., the maximum 
value extracted after the end of the kth epoch. For all k, f[k] is non-decreasing as t increases. 

Proof. Observe that Algorithm Q] only updates f[k] in Line[U for k < i where the current epoch is the ith 
epoch. The equivalence of f[k] and ft(k) follows immediately by an inductive argument over t. f[k] = 
ft(k) is seen to be non-decreasing by inspection of the definition of ft{k). □ 

Lemma 4.7. Let Xt(k,u),Yt(k,u) and Zt(k,u) denote the values of X[k, u], Y[k, u], and Z[k,u] after 
processing the tth element. Assume that the first t elements of the interaction sequence are a prefix of some 
interaction sequence in PQ, i.e., for all j G [t], (o~j = ext(f) =^ v = min(Mj_i)) where {Mj}jL is the 
family of multisets defined in Eq. (123b - Then, for any u G [U] and k = b(t, u), we have: 

cnt(cri :t ,«) = X t (k,u) - Y t (k,u) 

and for k < b(t,u), X t (k,u) = Y t (k,u). 

Proof. Let u G [U] be an arbitrary element. We proceed by induction on t. The lemma is true for t = 
where Xo(k, u) = Yo(k, u) = for all k, u. For the inductive step with k = b(t — l,u), there are four cases 
to consider: 

1. Case at = ins(u): In this case b(t — 1, u) = b(t, u) = k. Therefore, 

cnt(<7i;t,it) = cnt(<Ti :t _i,n) + 1 = 1 + X t -\{k,u) - Y t ^i(k,u) = X t (k,u) - Y t (k,u) 
The last step follows whether or not ft(b(t — l,u)) = u (lines [12] and [13] in Algorithm [T]). 

2. Case a t = ext(-u): In this case b(t — 1, u) = b(t, u) = k. Therefore, 

cnt(o-i:t,n) = cnt(<7i:t_i,u) - 1 = X t -i(k,u) - (Y t -i(k,u) + 1) = X t (k,u) - Y t (k,u) 

3. Case a t = ins(u) for v / u or at = ext(u) for v < u: In this case b(t — l,u) = b(t,u) = k. 
Therefore, 

cnt(<7i :t ,u) = cnt(<7i :t _i,u) = X t -i(k,u) - Y t -i(k,u) = X t (k,u) - Y t (k,u) 
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4. Case at = ext(v) for u < v. In this case we know that cnt(o"i :t _i, u) = 0. Assume it was not: 
then either there is a minimal prefix of a for some j such that cnt(o"i : j, u) < which implies that 
<Tj = ext. (it) but it / min(M 7 _i); or else cnt(o"i : t-i, u) > which implies that v ^ mm(M t -i) 
since min(M t _i) < u < v. Either way, we contradict our assumption on a. Therefore, 

cnt(ai;t,u) = cnt(cTi : t_i,ii) = X t -i(b(t - 1, u), u) - Y t -i(b(t - 1, u), u) 

= X t (b(t-l,u),u)-Y t (b(t-l,u),u) 

If b(t — 1, u) = b(t,u) we are done but it is possible that b(t — 1, u) / b(t,u). This is because 
following this extraction, for all 1 < £ < i, we set f[£] to max(/[£], v) > u which forces b(t, u) = i, 
where i is the current epoch. But then X t (b(t, u),u) = Y t (b(t, u), u) = since no inserts or extracts 
of u can yet have been associated with epoch i. Hence, even if b(t — 1, u) / b(t, u), cnt(<7i : t , u) = 
X t (k, u) — Y t (k, u) for k = b(t, u). 

In all cases, for k < b(t — 1, u), we observe that Algorithm Q] does not modify X[k, u] or Y[k, u] and these 
are already equal by the induction hypothesis. If k = b(t — l,u) < b(t,u), then, as reasoned in case 4 
above, we have X t (k, u) = Y t (k, u) as required. □ 

Theorem 4.8. If a ^ PQ, AlgorithmUlrejects, else it accepts. 

Proof. If a ^ PQ, consider the minimum t such that a t = ext(u) and u ^ min(M t _i). Let k = b{t — 
1, it). There are two possibilities. First, suppose u M t -\. Then, by Lemma 14771 before processing at, 
X t -\(k, u) — Y t -\{k, u) = 0. After processing at we have Y t (k, u) = Y t -\(k, u) + 1. Hence, 

Z t (k,u) > Y t (k,u) > X t {k,u). 

Since Z s (k,u) is non-decreasing in s and X s (k,u) = X t (k,u) for s > t after f{k) becomes equal to u, 
at the end of the algorithm Z^^k^u) ^ Xjv(A;,u). Hence the algorithm rejects a. Otherwise, suppose 
u € M t -\ but min(M t _i) = v ^ u. Then cnt(cri : t_i, v) > 0. Let k = b{t — l,v) and by Lemma 
14.71 X t -\(k, v) — Y t -i(k, v) > 0. Once ext(u) is processed, f[k] is increased to u and hence X s (k, v) > 
Y s (k, v) for all s > t, and the algorithm rejects. 

If a £ PQ, then by Lemma 14771 at t = N, X t (k,u) — Y t (k,u) = for all u, k. Consequently, 
Zt(k, u) > Y t (k, u) = X t (k, u) for all k, u. Since cnt(<7t, u) > for any a £ PQ, Y t (k, u) < X t (k, u) for 
all t. Hence Z t (k, u) < X t (k, u) and so Xjy = Y/v = Zn and the algorithm accepts. □ 

Local Consistency. We now consider a substring a' of a and show that if it does not violate some local 
conditions, then without loss of generality it can be assumed to be in a specific form. 

Definition 4.9. We say a 1 is locally consistent if both 

1. Vi < k,u < v : {a[= ins(u)) A (a' k = ext(v)) =^ (cnt(a' i+l . k _ l ,u) < 0). 

2. Vi < k,u > v : {a[ = ext(u)) A (a' k = ext(v)) =^ (cnt(<7^ +1 . k _ 1 , v) > 0). 

Observe that if a' is not locally consistent, then a ^ PQ, since the identified subsequence includes an 
extraction of an item which cannot be the smallest in the priority queue. 

Lemma 4.10. Given a = a p a'a . If a' is locally consistent, then there exists a mapping j(a') = 
a a a h a c a d such that a prei a'a mS G PQ iff a wef j(a')a mS G PQ. Here, a a and a c are both sequences of 
extracts in increasing order; and a b and a d are both sequences of inserts. The algorithm SUB -CHECK tests 
if a' is locally consistent and, if so, computes 7(0"') in time 0{\a'\ log \a'\). 
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Algorithm 2 SUB-CHECK 

l: input a' 

2: / <- 0; w <- 0; £ <- 0; I <- {oo} 

3: fori G [|a'|] do 

4: if cr^ = ins (it) then I -<— I U {u} 

5: if cjj' = ext(v) then 

6: m <— min(J) 

7: if (u > m) then reject 

8: if (v = m) then I <— I \ {v}; w <— max(w, v) 

9: if (v < m) then 

10: if w < max(/, then reject 

11: f^v;E^EU{v} 

12: end if 

13: end if 

14: end for 

15: output (ext(«i), . . . , ext(v\E\), ins(w), ext(w), ins(«i), . . . , ins(tt|/|)} where V{ and m are the ith 
smallest values of E and / respectively 



Proof. We first define the mapping 7 procedurally based on local rearrangements of the locally consistent 
a' which maintain local consistency. First consider all adjacent character pairs of the form ins(u), ext(v). 
Since the string is locally consistent, u > v. Whenever u > v, we interchange these characters to obtain 
ext(w), ins(u), without affecting local consistency or membership in PQ. Hence, we may assume that for 
every two adjacent characters ins(it), ext(v ), we have u = v, i.e., the pair represents an insertion followed 
immediately by an extraction of the same item. This generates a string a(a') that satisfies a p a(a')a 6 
PQ iff o-P re W uff G PQ. 

We next define two rearrangement rules on substrings of length three in a(a'). If applied to a string that 
was not locally consistent, they could "fix" errors, and lead to strings which are in PQ; however, since they 
are applied to locally consistent strings, the rearrangement preserves membership in PQ. 

1. ins(u) ext(it) ext(u) — > ext(f;) ins(u) ext(it). 

2. ins(u) ins(u) ext(n) — > ins(u) ext(tt) ins(u). 

By repeatedly applying these rearrangement rules to a(a') until no further rearrangement is possible we 
obtain j3(a'). Define the potential function over interaction sequences as <&(<r) = S<7 l= ext(«) Observe 
that each rearrangement reduces $ by 1, so the process terminates. Let fi{a') denote the final permutation 
and note that a pvef /3(a')a suG G PQ iff fj pre W suff G PQ. Then, for some ti,t 2 , t 3 , /3(<r') has the form, 

(ext(«i), . . . ,ext(v tl ),m.s(w{), ext(ioi), ins(tw 2 ), ext^), • • • ,'ms(wt 2 ),ext(w t2 ),'ms(u\), . . . ,ins(«t 3 )) 

where v\ < V2 < ■ ■ ■ < v tl . For w = maxjwi, . . . , wt 2 }, define 7(0"') = a a a b a c a d where 

a a = (ext(fi), . . . , ext(utj)), a b = (ms(w)), a c = (ext(w)), and a d = (ins(iii), . . . , ins(^ 3 )) . 

Observe that o* K[ 'Y(o J )o mB G PQ iff o- pisf ^(a')a m& G PQ and <7 pre Va suff G PQ iff a pref -r(a')a suS G PQ. 

We next show that it is possible to test local consistency and compute 7(0"') directly in 0(|<r'| log \a'\) 
time. Consider SUB -CHECK in Algorithm |2] 

We first argue that Algorithm |2]correctly determines whether a' is locally consistent. First observe that I 
records the multiset of items which have been inserted in a' and not yet extracted. A violation of Condition 1 
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in Definition |4.9| is detected in line [7] where the existence of m G I with m < v indicates that an insufficient 
number of ext(m) have occurred before the ext(t> ) being considered. 

A violation of Condition 2 is detected in line[lOl this is when the current character is ext(w) but there 
was an ext(it) for u > v earlier but no subsequent ins(u) that could be matched with the current ext(u). This 
is monitored via two variables, / and w. w is the maximum value extracted that is matched to an insertion 
happening within a', f is the most recent value extracted that is not matched within a'. Observe that because 
of the test in line [lOl / is non-decreasing. Consequently, max(/, w) is the largest value extracted so far. 
If there is some u > v such that ext(it) occurs in a' before ext(f), then max(/, «i) > u > v. Hence, it 
suffices to track only the greatest extracted item in a'. We can be sure that there is no ins(u) matching the 
ext(u) since m = min(Mj) > v: if v were matched, it would be present in / and found as m. 

The algorithm computes 7 correctly: / is the multiset of items that are inserted but not extracted in a, 
and E is the multiset of items that are extracted without a matching insert in a'. As noted above, w tracks 
the greatest item which is inserted and subsequently extracted in a', so the output has the necessary form. 
Implementing I and E as priority queues means that each character is processed in 0(log |<r'|) time, giving 
total 0(\a'\ log \a'\) time and 0{\a'\) space. □ 

Consequently, by breaking a into sequential substrings of length I and reordering each substring (unless 
we determine the substring is not locally consistent) we may ensure that the interaction sequence has the 
form a = a El a h a E ' 2 a l2 . . . a Er a Ir where r = 2\N/f\. The final algorithm runs PQ-CHECK and SUB- 
CHECK in parallel. The space required by SUB-CHECK is 0(1 log U) bits and we will show that PQ-CHECK 
can be implemented in 0(r (log N + log U)) bits. Setting I = \fN yields Theorem 14. 5 i 

Finishing the Proof: A Small-Space Implementation of PQ-CHECK. Rather than maintain the arrays 
X, Y, and Z explicitly in PQ-CHECK, it suffices to keep a linear hash (which serves as a homomorphic 
fingerprint) of each array. These fingerprints can be compared, and if they match in Line[l6l then, with high 
probability, the arrays agree. In Line [7] we need to perform a max operation between two values. This can 
be done by maintaining Y[k, ft{k)] and Z[k, f t (k)] explicitly for each k. At any time, there are at most 
r such values that are needed: observe that when f t+ \(k) > ft(k), Y[k, ft(k)] and Z[k, ft(k)] are never 
subsequently altered. The new values for Y[k, f t +i(k)\ and Z[k, ft+i(k)] are initialized to 0. Hence, the 
space of the algorithm is 0(r) words to store the Y[k, f[k}], Z[k, f[k]] and f[k] values, and a constant 
number of fingerprints to represent X, Y, and Z. 



4.2 Passive Checking of Stacks, Queues, and Deques 

Stack. Let STACK denote the language over interaction sequences that conesponds to stack operations. 
Now ins(u) corresponds to an insertion of u to a stack, and ext(it) is an extraction of u from the stack. Then 
a G STACK iff a corresponds to a valid transcript of operations on a stack which starts and ends empty. 
That is, the state of the stack at any step j can be represented by a string so that S° = 0, = uS^ 1 if 
crj = ins(n) and = ^T^- 1 ! if &j = ext(tt). Then a G STACK for \a\ = N iff 

S N = and Vj G [N], (aj = ext(n) => u = S{~ 1 ) 

Theorem 4.11. Every 0(1/ log 2 N) -error, p-pass, s-space randomized streaming algorithm to recognize 
STACK on length N streams must satisfy ps = £l(y/~N). It is possible to recognize STACK in one pass with 
0{yJ~N log N) bits of space with high probability. 

Proof. First, we observe that for U = 2, DYCK({7) = STACK if we associate ins(n) with u and ext(ii) with 
u. Therefore, the lower bound follows immediately. For the upper bound, the one-pass algorithm from lfl31l 
to recognize DYCK(2) can be used to recognize STACK over arbitrary U by appealing to their reduction from 
dyck(CZ) to dyck(2). □ 
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We note that the algorithm of |[T5l for recognizing DYCK(2) can be used directly to recognize DYCK(C7) 
rather than appealing to the reduction from dyck(J7) to DYCK(2). In outline, the algorithm works as 
follows. The input string is broken into blocks of length yN. Within each block, any adjacent pair of the 
form (ins (it), ext(u)) can be matched and removed. When no further removals of pairs are possible, the 
block now has the form of a prefix of extracts followed by a suffix of inserts. The algorithm keeps a stack of 
hashes of (item, height) pairs, along with the number of items summarized in each hash. Each item extract 
in the block, along with the current height, is removed from the hash on the top of the stack, until the hash 
supposedly represents no items. If the hash is not identically zero, the algorithm rejects. Otherwise, the 
procedure proceeds to the next hash on the stack, until the prefix of extracts are exhausted. Then the inserted 
items are hashed with their current height, to build a single new hash value which is pushed onto the top of 
the stack. 

Queue. Let QUEUE denote the language over interaction sequences that correspond to queues. That is, 
the state of the queue at any step j can be represented by a string Qi so that Q° = 0, Q- 7 = Qi~ x u if 
<jj = ins(-u) and Qi = Q^gj-ii if a j = ext(-u). Then a G QUEUE for |cr| = N iff 

Q N = and Vj G [N], (aj = ext(u) =>- u = Qj -1 ) 

As observed in |5], it is possible to recognize QUEUE with a single pass and 0(log N) space: we use a 
single fingerprint to check that the value of the ith insert equals the value of the ith extract for all i G [N] . 

Deque. Let DEQUE denote the language over interaction sequences that corresponds to double-ended 
queues. That is, there are now two types of insert and extract operations, one operation for the head and 
one for the tail. Clearly, since a deque can simulate a stack via operations on the tail only, recognizing 
DEQUE is at least as hard as recognizing STACK. For the upper bound, it is possible to adapt the algorithm 
of lfl31 . Again, each block of yiV operations is partitioned into a prefix of extractions (to head and tail) and 
insertions (to head and tail). Now we maintain a deque of hash values of item, height pairs. Each extract 
to the head is applied to the hash at the head of the deque of hashes, and each extract to the tail is applied 
to the hash at the tail of the deque. The same check is applied: any hash which should now summarize no 
items must be identically zero (otherwise, the algorithm rejects). Inserts to the head are parceled up into a 
hash which is placed at the head of the deque, and inserts to the tail are placed in a hash at the tail of the 
deque. Then we accept a if after processing a the algorithm reaches an empty deque and has not rejected at 
any point. This gives the following theorem. 

Theorem 4.12. Every 0(1/ log 2 N) -error, p-pass, s-space randomized streaming algorithm to recognize 
DEQUE on length N streams must satisfy ps = £l(y/~N). It is possible to recognize DEQUE in one pass with 
0{y/~N log N) bits of space with high probability. 

4.3 Variations with timestamps 

As noted in the introduction, the results of Blum et al. HI can be viewed as recognizing languages where each 
ext(-u) is augmented with the timestamp of its matching ins(u), and is denoted ext(w, t). These languages 
are defined as before, but with the additional constraints that each t G [N] appears at most once across all 
extracts and 

Vj G [N], {aj = ext(v,t) => at = ins(u)) 

This defines the variant languages QUEUE-TS, STACK-TS, DEQUE-TS and PQ-TS. The observations of 
Blum et al. imply that verifying strings in STACK-TS and QUEUE-TS (and ensuring that all the timestamps 
are also consistent) requires only 0(logN) space. The same argument also gives an O(logiV) bound 
for deques. For PQ-TS, the problem seems harder: Chu et al. [8] gave an 0(y/~N) streaming algorithm 
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which relied heavily on the presence of timestamps (and hence does not recognize PQ without timestamps). 
We leave as an open question the problem of fully resolving the complexity of recognizing priority queue 
sequences with timestamps, since the reduction via augmented indexing no longer holds in this case. 
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A Proof of the Direct Sum Theorem 

For completeness, we give a full proof of our direct sum theorem that relates the information complexity of 
MULTI-AI with that of Al. We begin with a small technical lemma that is an interesting observation in its 
own right. 

Lemma A.l. Let P be a communication protocol involving two players, Alice and Bob, who share a public 
random string R in addition to their private random strings. Let T denote the transcript of P when Alice 
receives input X and Bob receives Y, from an arbitrary input distribution. Let A and B denote the portions 
ofT that are communicated by Alice and Bob, respectively. Then 

I(T : X | Y, R) = 1(A: X \ Y, R) and l(T : Y \ X, R) = l(B : Y | X, R) . 

Proof. By the chain rule for mutual information, we have 

I(T : X | Y, R) = l(AB : X \Y,R) = l(A : X \ Y, R) + l(B : X \ A, Y, R) . 

Since Bob's messages are just some function of A, Y, R, and his private coins, for any fixed setting of 
A, Y, R, we have that B and X are independent. Thus, l(B : X | A, Y, R) = 0. Similarly, we can show that 
I(T : Y | X, R) = I(B : Y | X, R). □ 

Theorem 13.21 (restated). Suppose there exists a \p, s , e]-protocol Q for MULTI-Al m n . Then there exists an 
e-error randomized protocol P for A\ n in which Alice sends at most ps bits in total, and which satisfies 

m ■ icost^ (P) < icost^cam (Q) , 

where //o is as in Definition \2.1\ and ii® m denotes the m-fold product of hq with itself. 

Proof. Using Q, we can derive a family, {Pj}j£[ m ], of protocols for Al, using the following simulation. 
Suppose Alice and Bob receive inputs x and (k, c, xi : k-i) respectively. 

1. Alice sets Afs input to x and Bob sets Bfs input to (k, c, 

2. The players generate X 1 , X 2 , . . . , X^ 1 , X 3+1 , . . . , X m , K 1 , . . . , K 3 ^ 1 independently and uniformly 
at random using public coins. They choose C 1 , . . . , C J_1 so that X l Kl = C l for all i £ \j — 1]. This 
sets the input to players A±,Bi, . . . , Aj—i, Bj-i and ensures that (X 1 , K l , C l ) ~ f° r ai l i < 3- 

3. Bob generates K J+ , K J+2 , . . . , K m independently and uniformly at random using private coins. He 
chooses C J+1 , . . . , C m so that X t Rl = C % for each i € {j + 1, . . . , m}. This sets the input to players 
Aj + i, Bj + i, . . . , A m , B m and ensures that (X 1 , K l , C l ) ~ f° r an i > j- 

4. The players now jointly simulate Q on the random input Z thus generated. In each round: 

(a) Alice simulates players A\, B\, . . . , Aj and sends Bob the message that Aj would have sent to 

By 

(b) Bob simulates Bj,Aj + i, . . . , B m and then sends Alice the message that B m would have sent to 
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(c) Alice then continues the simulation of A m , . . . , A\ and moves on to beginning of the next round 
(if required), without having to communicate anything. 

5. At the end of the simulation, Alice outputs the answer that player A\ would have output in Q. 

Clearly, Alice communicates at most ps bits in Pj. The definition of ensures that Al(X l , K\ C l ) = 
for all i ^ j, and therefore MULTl-Al(Z) = Al(X, K, C); thus Pj is correct whenever Q is correct on the 
randomly generated input. This bounds the worst-case error of Pj by e. To bound the information cost of Pj, 
notice that when the input to Pj is distributed according to no, it simulates Q on an input that is distributed 
according to /x® m . Let (X J \ K J C J ') denote a random input to Pj distributed according to no, and let 
T and B denote the resulting random transcript of Pj, and Bob's portion of this transcript, respectively. 
Defining M m and R as in (l22l ). we see that B = M m and that the public random string used by Pj is exactly 
R' = (R, X~j, K\ . . . , Ki- 1 ). Thus, 

icost^P,) = l(T:K^a\X^R') 
= l(B : K j ,C j | X j ,R') 

= I(M m : K j ,C j \ K 1 ,..., Ki-\X\ ...,X m ,R), 
where the second equality follows from Lemma lATTl By the chain rule for mutual information, we have 

icost^ m (Q) = I{M m : K\C l ,...,K m ,C m \ X\...,X m ,R) 

m 

= ^I(M m :Ki,C j | if\CV--,^ i ~\C i ~ 1 ,X 1 ,...,X m ,i?) 

m 

= J ( M ™ : Ki > C j \K\..., Ki~\x\ ...,X m ,R) (24) 

3=1 
in 

3=1 



where (|24l) holds because X J and K 3 completely determine C J , according to the distribution /xo- Picking j 
t^ (Pj) now gives us m ■ icost^ (Pj) < icost^®, 



to minimize icost^ o (Pj) now gives us m ■ icost£ ) (P J ) < icost ®m(Q). □ 



B Reduction from dyck(2) to PQ 

Lemma |431 (restated). There exists an 0(log N)-space stream reduction from DYCK(2) to PQ(4A^). 
Proof. Consider a string p over parentheses {a, a, b, b} and define 

height(p) := | {j : pj G {a, b}}\-\ {j : pj G {a, b}} \ 

and height(e) = 0. Define the transformation by ij)(p) = <ft(Pi:i)4'{Pi:2) ■ ■ ■ 4>(Pi:N) where: 



4>{p 



^(2^ — 2 height if pi = a 

ext(2N — 2 height (pi:*)) if Pi = a 

ins(27V - 2 height - 1) if pi = b 

ext(2iV- 2 height (p 1:i ) - 1) if pi = b 
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First, note that the transformation can be done in 0(log N) space since it is sufficient to maintain the 
height of the last two elements. The transformation is onto [4ZV], since for any arbitrary string p of N 
parentheses — N < height (pi : jv-l) < N. 

We now argue that p G DYCK(2) iff ip(p) G PQ. For notational convenience, we first define ip(p\p') = 
<fr(p'pi:i)<fr(p'pi:2) ■ ■ ■ 4>{p' Pv.\p\) and note that tp(p'p) = ip{p')4>{p\p'). 

• p G DYCK(2) implies ip(p) G PQ: We prove this by induction on the length of p. We may decompose 
p = p l ccp 2 where c G {a, b} and h = height^ 1 ) is maximal over all such decompositions. Without 
loss of generality assume c = a. Note that p 1 p 2 G DYCK(2) and hence, by induction il)(p l p 2 ) = 
i^>{p 1 )'^(p 2 \p 1 ) G PQ. But observe that 

which is in PQ because ipip^ipip 2 ^ 1 ) G PQ and tj){bb\p x ) = ms(2iV - 2h — 1) ext(2iV - 2h — 1) 
where 2N — 2h — 1 < {u : ins(n) G ^(p 1 )}. Since h is maximal, 2iV — h — 1 is indeed the smallest 
value when it is extracted. 

• p ^ DYCK(2) implies ip(p) ^ PQ. Since p ^ DYCK(2), a standard characterization of the language 
implies that one of the following cases is true: 

- Case 1. height(pi : Ar) / 0. Therefore, there are different numbers of extracts and inserts in 
ijj(p) and hence tp(p) ^ PQ, since each open parenthesis maps onto an insert and each close 
parenthesis maps onto an extract. 

- Case 2. height(pi : j) < for some i G [N]. Therefore, there are more extracts than inserts in a 
prefix of i)j{p) and hence ijj(p) ^ PQ. 

- Case 3. There exists a smallest j such that for some i < j, 

* height(pi :i _i) = height (p 1:j ) =: h 

* Pi: j-i is a prefix for a string in DYCK(2) and hence ijj(pi-j-i) is a prefix for a string in PQ 

* (Pi,Pj) = (a,b) or (pi,pj) = (b,a). 

Since ip(pi-.j-i) is a prefix for a string in PQ, we can consider the state, Mj-x, of the priority 
queue after the interaction sequence ^(px:j-i) as defined in Definition 14.21 Note that Mj_i 
contains at most one element from {2iV — 2k — 1, 2N — 2k} for each k (else j was not the 
minimal choice). If (pi,pj) = (b, a) then 2N - 2h - 1 G M,_i. But 4>(pi :j ) = ext(2N - 2h) 
and we therefore deduce that tp(p) ^ PQ. If {pi,Pj) = {a, b) then 2N — 2h G Mj_i and hence 
2N - 2h - 1 ^ Mj_i. Since j) = ext(2iV - 2/i - 1), we deduce that ^ PQ. 

□ 
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